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Abstract 

In this paper we explore maximal deviations of large random structures from their typical 
behavior. We introduce a model for a high-dimensional random graph process and ask analogous 
questions to those of Vapnik and Chervonenkis for deviations of averages: how “rich” does the 
process have to be so that one sees atypical behavior. 

In particular, we study a natural process of Erdos-Renyi random graphs indexed by unit 
vectors in We investigate the deviations of the process with respect to three fundamental 
properties: clique number, chromatic number, and connectivity. In all cases we establish upper 
and lower bounds for the minimal dimension d that guarantees the existence of “exceptional 
directions” in which the random graph behaves atypically with respect to the property. For 
each of the three properties, four theorems are established, to describe upper and lower bounds 
for the threshold dimension in the subcritical and supercritical regimes. 


1 Introduction 

One of the principal problems in probability and statistics is the understanding of maximal devia¬ 
tions of averages from their means. The revolutionary work of Vapnik and Chervonenkis [I9l[20l[2l] 
introduced a completely new combinatorial approach that opened many paths and helped us un¬ 
derstand this fundamental phenomena. Today, the Vapnik-Chervonenkis theory has become the 
theoretical basis of statistical machine learning, empirical process theory, and has applications in 
a diverse array of fields. 

The purpose of this paper is to initiate the exploration of maximal deviations of complex random 
structures from their typical behavior. We introduce a model for a high-dimensional random graph 
process and ask analogous questions to those of Vapnik and Chervonenkis for deviations of averages: 
how “rich” does the process have to be so that one sees atypical behavior. In particular, we study 
a process of Erdos-Renyi random graphs. In the G{n,p) model introduced by Erdos-Renyi [H [7], a 
graph on n vertices is obtained by connecting each pair of vertices with probability p, independently, 
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at random. The G{n,p) model has been thoroughly studied and many of its properties are well 
understood—see, e.g., the monographs of Bollobas [1] and Janson, Luczak, and Rucihski [9]. 

In this paper we introduce a random graph process indexed by unit vectors in defined as 
follows. For positive integer n, write [n] = {!,... ,n}. For 1 < i < j < n, let Xjj be independent 
standard normal vectors in Denote by the collection of these random 

points. For each s G (where denotes the unit sphere in M'^) and t G M we define the 

random graph r(X„, s, t) with vertex set u(r(X„, s, t)) = [n] and edge set e(F(X„, s, t)) = {{i, j} : 
{Xij, s) > t}, where (•, •) denotes the usual inner product in 

For any fixed s G and f G M, T{Xn, s,t) is distributed as an Erdos-Renyi random graph 
G{n,p), with p = 1 — <k(t) where ‘k is the distribution function of a standard normal random 
variable. In particular, r(X„,s,0) is a G(n, 1/2) random graph. With a slight abuse of notation, 
we write F(X„, s) for r(X„, s, 0). 

We study the random graph process 

Gd,piXn) = {r(X„,s,^>-ni -p)) : S G . 

Gd,p{Xn) is a stationary process of G{n,p) random graphs, indexed by d-dimensional unit vectors. 
For larger values of d, the process becomes “richer”. Our aim is to explore how large the dimension 
d needs to be for there to exist random directions s for which r(X„,s,$“^(l — p)) G Gd,p{Xn) 
has different behavior from what is expected from a G{n,p) random graph. Adapting terminology 
from dynamical percolation jlHj, we call such directions exceptional rotations. More precisely, in 
analogy with the Vapnik-Chervonenkis theory of studying atypical deviations of averages from 
their means, our aim is to develop a VC theory of random graphs. In particular, we study three 
fundamental properties of the graphs in the family the size of the largest clique, the 

chromatic number, and connectivity. In the first two cases we consider p = 1/2 while in the study 
of connectivity we focus on the case when p = clogn/n for some constant c > 0. 

The graph properties we consider are all monotone, so have a critical probability p* at which 
they are typically obtained by G{n,p). For example, consider connectivity, and suppose we first 
place ourselves above the critical probability in G{n,p), e.g., p = clogn/n for c > 1, so that G{n,p) 
is with high probability connected. Then the question is how large should d be to ensure that 
for some member graph in the class, the property (connectivity) disappears. There is a threshold 
dimension d for this, and we develop upper and lower bounds for that dimension. Secondly, consider 
the regime below the critical probability for connectivity in G{n,p), e.g., p = clogn/n for c < 1. 
In this case, with high probability G{n,p) is not connected, and we ask how large d should be to 
ensure that for some member graph in the class, the property (connectivity) appears. Again, we 
develop upper and lower bounds for the threshold dimension d for this. 

In all, for each of the three properties considered in this paper, clique number, chromatic number, 
and connectivity, four theorems are needed, to describe upper and lower bounds for the threshold 
dimension for exceptional behaviour in the subcritical regime (when the property typically does 
not obtain) and in the supercritical regime (when the property typically does obtain). In every 
case, our results reveal a remarkable asymmetry between “upper” and “lower” deviations relative 
to this threshold. 

Our techniques combine some of the essential notions introduced by Vapnik and Chervonenkis 
(such as shattering, covering, packing, and symmetrization), with elements of high-dimensional 
random geometry, coupled with sharp estimates for certain random graph parameters. 
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The model considered in this paper uses subsets of the collection of halfspaces in to define 
the random graphs in the collection. A natural variant would be one in which we associate with 
each edge {i,j} a uniformly distributed random vector on the torus [0,1]'^, and consider a class 
parametrized by s G [0, l]*^. Then define the edge set e(r(JCn, s, t)) = {{i,j} ■ W^ij — ■s|| < t}. For 
general classes of sets of the complexity of the classes will affect the behaviour of the collection 
of random graphs in a universal manner. We can define the complexity of a class of graphs indexed 
in terms of the threshold dimension needed to make certain graph properties appear or disappear 
in the subcritical and supercritical regimes, respectively. It will be interesting to explore the 
relationship between the combinatorial geometry of the class and these complexities. 

Note that when d = 1, Gi^p{Xn) only contains two graphs (whenp = 1/2, one is the complement 
of the other), and therefore the class is trivial. On the other extreme, when d> (2) , with probability 
one, the collection contains all 2 ( 2 ) graphs on n vertices. This follows from the following 

classical result on the “VC shatter coefficient” of linear half spaces (see, e.g., Schlaffli m, Cover 
0 ) that determines the number of different graphs in i/ 2 (Aln) (with probability one). 


Lemma 1.1. Given N > d points xi,... ,xn G in general position (i.e., every subset ofd points 
is linearly independent), the number of binary vectors b G {0, l}'^ of the form b = (ij^j,, ,,^> 0 })^^^ 

for some s G S'^~^ equals 


d-l 


C{N,d) = 2Y, 


k=0 


N -I 
k 


In particular, when N = d, all 2^ possible dichotomies of the N points are realizable by some 
linear half space with the origin on its boundary. In such a case we say that the N points are 
shattered by half spaces. 


Notation and Overview. Throughout the paper, log denotes natural logarithm. For a sequence 
{An} of events, we say that An holds with high probability if lim„_,.oo 1P{A„} = 1. 

The paper is organized as follows. In Section]^ we study the clique number in the case p = 1/2. 
The four parts of Theorem 2.1 establish upper and lower bounds for the critical dimension above 
which, with high probability, there exist graphs in G^ i/ 2 (A^n) whose largest clique is significantly 
larger/smaller than the typical value, which is ~ 2 log 2 n — 2 log 2 log 2 n. We show that the critical 
dimension for which some graphs in G^^i/ 2 (A^n) have a clique number at least, say, 101 og 2 n is of 
the order of log^ n/ log log re. 

In sharp contrast to this, d needs to be at least n^/polylog n to find a graph in Grf^i/ 2 (Aln) 
with maximum clique size 3 less than the typical value. We study this functional in Section 
Theorem 3.1 summarizes the four statements corresponding to upper and lower bounds in the sub-, 
and super-critical regime. Once again, the two regimes exhibit an important asymmetry. While no 
graphs in G^^ i/ 2 (-^n) have a chromatic number a constant factor larger than typical unless d is is 
of the order of re^/ polylogn, there exist graphs with a constant factor smaller chromatic number 
for d near re. 

Finally, in Section]^ connectivity properties are examined. To this end, we place ourselves in 
the regime p = clog re/re for some constant c. When c < 1, a typical graph G{n,p) is disconnected, 
with high probability, while for c > 1 it is connected. In Theorem 4.1 we address both cases. 
We show that for c > 1, the critical dimension above which one finds disconnected graphs among 
G^ ciogn/n(^n) IS of the Order of log re/log log re. (Our upper and lower bounds differ by a factor 
of 2.) We also show that when c < I, d needs to be at least roughly re^“‘^ in order to find a 
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connected graph ciogn/n(^n)- While we conjecture this lower bound to be sharp, we do not 
have a matching upper bound in this case. However, we are able to show that when d is at least 
of the order of nyjlogn, ciogn/n(-X^n) not only contains some connected graphs but with high 
probability, for any spanning tree, there exists s G such that r(X„, s, t) contains that spanning 
tree. This property holds for even much smaller values of p. 

In the Appendix we gather some technical estimates required for the proofs. 


2 Clique number 

In this section we consider p = 1/2 and investigate the extremes of the clique number amongst the 
graphs r(W„, s), s G Denote by cl{Xn, s) the size of the largest clique in r(X„, s). 

The typical behavior of the clique number of a G(n, 1/2) random graph is quite accurately 
described by Matula’s classical theorem [12] that states that for any fixed s G for any e > 0, 

cl{Xn,s) G {[ca - ej, [w + ej} 

with probability tending to 1, where w = 2 log 2 n — 2 log 2 log 2 n + 2 log 2 e — 1. 

Here we are interested in understanding the values of d for which graphs with atypical clique 
number appear. We prove below that while for moderately large values of d some graphs s) 

have a significantly larger clique number than cj, one does not find graphs with significantly smaller 
clique number unless d is nearly qua dratic in n. 

Observe first that by Lemma 1.1 for any k, il d> ( 2 ), then, with probability one, cl{Xn, s) > k 
for some s G (Just fix any set of k vertices; all 2^ graphs on these vertices is present for 

some s, including the complete graph.) For example, when d rsj (9/2)(log2 n)2, cl{Xn,s) > 3 log 2 n 
for some s G a quite atypical behavior. In fact, with a more careful argument we show 

below that when d is a sufficiently large constant multiple of (log n)^/log log n, then, with high 
probability, there exists s G such that d{Xn, s) > 31og2 n. We also show that no such s exists 
for d = o((logn)^/loglogn). Perhaps more surprisingly, clique numbers significantly smaller than 
the typical value only appear for huge values of d. The next theorem shows the surprising fact that 
in order to have that for some s G cl{Xn,s) < w — 3, the dimension needs to be 

(Recall that for d = ( 2 ) the point set Xn is shattered and one even has cl{Xn, s) = 1 for some s. 

Our findings on the clique number are summarized in the following theorem. 

Theorem 2.1. (clique number.) If cl{Xn, s) denotes the clique number ofT{Xn,s), then, with 
high probability the following hold: 

(i) (subcritical; necessary.) Ifd = o(n^/(logn)9), then for all s G , cl{Xn,s) > oj-2> . 

(a) (subcritical; sufficient.) Ifd > ( 2 ), then there exists s G S'^~^ such that cl{Xn,s) = 1 . 

(in) (supercritical; necessary.) For any c > 2 there exists c' > 0 such that ifd < c' log^ n/log log n, 
then for all s G we have cl{Xms) < clog 2 n. 

(iv) (supercritical; sufficient.) For any c> 2 and d > c^/(21og2), if d> d log^ n/log log n, 
then there exists s G 5"^“^ such that cl{Xn,s) > clog 2 n . 

The event described in (ii) holds with probability one for all n. 
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Proof. To prove part (i), let k = [a; — 3j and let Nk{s) denote the number of cliques of size k in 
r(X,i, s). Let rj G (0,1] and let Cr^ be a minimal r/-cover of 5"^“^. Then 

P|3s E : Nk{s) = O} 

= P E Cri and 3s E 5"^“^ : ||s — s'|| < r] : Nk{s) = o| 

< |C,;|P jds E 5"^“^ : ||s — Soil < r/ : Nk{s) = o| 


where 5 o = (1,0,..., 0) and the last inequality follows from the union bound. Consider the graph 
r(-X^ri; So, — r]‘^f2) in which vertex i and vertex j are connected if and only if the first com¬ 

ponent of Xij is at least — 77^ /1 — r ?^/2 


The proof of Lemma 5.3 implies that the event |3s E S’'^ ^ : ||s — so|| < rj : Nj^{s) = 0| is in¬ 


cluded in the event that r(X„, sq, — r/y^l — 'q‘^/2) does not have any clique of size k. By Lemma 
5.3t the probability of this is bounded by the probability that an Erdds-Renyi random graph 
G{n, 1/2 — an) does not have any clique of size k where an = If we choose (say) rj = \/r? 

then for d < n? we have Un < 1/n and therefore, by Lemma 5.7 below, 




3s E S' 


d-l 


\s - Sol 


< V ■■ = 0 } < exp ( 


for some numerical constant C. Thus, using Lemma 5.1 

P {ae e 5“-' : iV»(e) = o} < (4„2)‘‘exp = »(1) 


1.1 


with d 


whenever d = o(n^/(logn)®). 

Part (ii) follows from the simple fact that, by Lemma 
appears among the r(X„, s). 

The proof of part (iii) proceeds similarly to that of part (i). Let k = 


= ( 2 ) even the empty graph 
clog 2 n. Then 


P|3s E : Nk{s) > 1 } 

< |Cj 7 |P |3s E : ||s — Soil < 7/ : Nk{s) > l| . 


Similarly to the argument of (i), we note that the event {3s E S”^ ^ “ •soil < : -^fc(s) > l} is 

included in the event that T{Xn,SQ,—r]yJY^^rf^) has a clique of size k, which is bounded by 
the probability that an Erdds-Renyi random graph G{n, 1/2 -|- an) has a clique of size k where 
an = Denoting p = 1/2 -|- this probability is bounded by < {np^G'^_ We may 

choose 77 = 4/(i. Then, for d sufficiently large, an < (c/2 — 1) log 2 and, using Lemma |5.1[ we have 


P{3s E : iVfc(s) > 1 } < 

< 

< 

< 


(4/77)'^ (77p("/2)log2ny^°S2>^ 

l-h(c/2)log2(l/2-ha„)y'°*^2« 

l-c/2-|-ca„/ log2^ "■ 

gdlogd^(l-c/2)c(log2 n)/2 
gdlog d-(c-2)c(log2 n)2 log 2/4 


gdlogd 

gdlogd 
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and the statement follows. 


It remains to prove part (iv). The proof relies on the second moment method. Let c > 2, c/ > 
c^/(21og2), and assume that d > c'log^ n/loglogn. Let iL be a constant satisfying K > Ij^/d 
and define 6 = K \/log log n / log re. Let A be a subset of S‘^~^ of cardinality at least 
such that for all distinct pairs s, s' G ^4, we have (s, s') > cos(0). Such a set exists by Lemma 
Also, let C be the family of all subsets of [n] of cardinality k = [clog 2 nJ. For s G A and 7 G C, 
denote by the indicator that all edges between vertices in 7 are present in the graph r(X„, s). 

Our aim is to show that lim„_,.oo P{Z > 0} = 1 where 


5.2 


“ X/ X/ 

seA-yeC 


To this end, by the second moment method (see, e.g., HI), it suffices to prove that EZ —?• 00 and 
that E[Z^] = (EZ)2(1 + o(1). 

To bound EZ note that 


EZ = \A\(^^EZs,^ 

= exp((logn)^(c'-^ + j^+<,(l)))^oo 

On the other hand. 


E[Z^] 


def 


s,s'eA-y,yec 

E E E[Zg^jZs/^y] + EE E[Zg^^Zg^y] 

s^s':s^s' 7,7^GC:|7n7^|<l sGA 7,7^gC 

s,s':sj^s'£A 7,7'eC:|7n7'|>2 

/ + // + ///. 


For the first term note that if 7 and 7 ' intersect in at most one vertex then Zg^^y and Zg/^y are 
independent and therefore 

'= E E EZg^^^Zg!yy! < {EZ)‘^ . 

s,s'-.syts'&A 7,7'eC:|7n7'|<l 
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Hence, it suffices to prove that II + III = o((EZ)^). To deal with II, we have 

k 


II 


1 


(EZ)2 






n — k\ fk 


< 


1 

Pil 


i=0 




- 

< ^ 2 ( 2 )^ 
£=0 


(n - 2A:)<^£! 
2 \ ^ 


i^r 




n — 2k J £l 


1 


= exp (-(logn)^ (c + 0 ( 1 ) - c^/(21og2))) 
We now take care of III. To this end, we bound 


0 . 


max ElZs^jZ^/^y] 

s,s'gA:s^s' 

7,7':|7n7'|=£ 


by 


2(2)-2Q)+1i 


N 


,so)> sin(6'/2) 


(D 


where is a standard normal vector in To see this, note that 2 ( 2 ) — ( 2 ) edges of the two 
cliques occur independently, each with probability 1/2. The remaining ( 2 ) edges must be in both 
r(X„,s) and r(X„,s'). A moment of thought reveals that this probability is bounded by the 
probability that the angle between a random normal vector and a hxed unit vector (say sq) is less 
than 7r/2 — 0/2. This probability may be bounded as 

E{(lV/||lV||,so) >sm(0/2)} = {B > sm^(d/2)} 


< 


(where H is a Beta(l/2, (d — l)/2) random variable) 
EB 


2 sin 70 / 2 ) 

1 


2 dsin 70 / 2 ) 

2 + 0 ( 1 ) 2 + 0 ( 1 ) 


d 02 


'K2 


Via the same counting argument used in handling II, we have 

2 - I r'R'2 


III 


(EZ) 


e=2 




n 


1 


Since c' > 4, we have, for n large enough, 

k / T 2 \ (- 


III 


{EZ) 


2 s 


E 

t=2 


k^ 


n 


-2k £! 


= O 


-2kJ £! 


(log n) 




as required. This concludes the proof of the theorem. 


□ 


We conclude the section by remarking that the above proof extends straightforwardly to G{n,p) 
for any constant p £ (0,1). 
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3 Chromatic number 


A proper coloring of vertices of a graph assigns a color to each vertex such that no pair of vertices 
joined by an edge share the same color. The chromatic number x{G) of a graph G is the smallest 
number of colors for which a proper coloring of the graph exists. 

Here we study the fluctuations of the chromatic numbers x(r(X„, s)) from its typical behavior 
as s G Once again, for simplicity of the presentation, we consider p = 1/2. The arguments 

extend easily to other (constant) values of p. 

For a fixed s, a celebrated result of Bollobas [H] implies that 


n 

2 log 2 n 


<x(r(x„,s)) < 


n 

2 log 2 n 


(1 + 0 ( 1 )) 


with high probability. 

In this section we derive estimates for the value of the dimension d for which there exist random 
graphs in the collection Gdp/ 2 {^n) whose chromatic number differs substantially (i.e., by a constant 
factor) from that of a typical G(n, 1/2) graph. Similar to the case of the clique number studied in 
Section we find that upper and lower deviations exhibit a different behavior—though in a less 
dramatic way. With high probability, one does not see a graph with a clique number larger than 
(1 + e)n/(21og2n) unless d is at least / polylog n. On the other hand, when d is roughly linear 
in n, there are graphs is i/ 2 (Aln) with chromatic number at most (1 — e)n/(21og2 n). Below we 
make these statements rigorous and also show that they are essentially tight. 


Theorem 3.1. (chromatic number.) Let e G (0,1/2). //x(r(X„,s)) denotes the chromatic 
number ofT{Xn,s), then, with high probability the following hold: 


(i) (subcritical; necessary.) If d = o(n/(logn)^), then for all s G 5^“^ x(r(X„,s)) > 
(1 - e)n/(21og2n). 

(a) (subcritical; sufficient.) If d > 2nlog9n/(l — 2e), then there exists s G such that 
xi'I'iXn, s)) < (1 - e)n/ (2 log2 n). 

(Hi) (supercritical; necessary.) If d = o(n^/(logn)®), then for all s G 5”^“^ x(r(X„,s)) < 
(1 + e)n/(21og2n). 

(iv) (supercritical; sufficient.) If d > .5 [(1 + e)n/(21og2n)]^, then there exists s G 
such that x(r(X„, s)) > (1 + e)n/(2 log2 n). 


Part (i) of Theorem 3.1 follows from the following “uniform concentration” argument. 
Proposition 3.2. If d = o{n/{logn)^), we have 


sup 

seS**-! 


x(r(x„,s))- 


n 


— Or} 


n 


log2n 


2 log 2 n 

Proof. A classical result of Shamir and Spencer|17j shows that for any fixed s G 

|x(r(X„, s)) - E (x(r(X„, s)))\ = . 









In fact, one may easily combine the above-mentioned results of Bollobas and Shamir and Spencer 
to obtain that 

Ey(r(X„,s)) ^ ^ 

n/( 21 og 2 n) 

The proof of the proposition is based on combining the Shamir-Spencer concentration argument 
with Vapnik-Chervonenkis-style symmetrization. 

For each s G and i = 2,...,re, define Yi^g = (l{(Xi^ {0; as the 

collection of indicators of edges connecting vertex i smaller-labeled vertices in s). As Shamir 

and Spencer, we consider the chromatic number s) as a function of these variables and define 

the function / : nr= 2 { 0 > 1}®“^ —> N by 

f{Y2,s,...,Yn,s) = X{nXn,s)) . 

By Markov’s inequality, it suffices to show that 


E 


sup 1 /(^ 2 ,*, . . . , - E/(y 2 ,., . . . , Yn,s 

seS'i-i 


= o 


re 


log re 


Let X'j^ = {XY)i<i<j<n be an independent copy of Denote by E' conditional expectation 
given Xn- We write ^ {0,1?"^- 

Also introduce random “swap operators” 62 , • • •, Cn defined by 

.(Y. vM - / probability 1/2 

i,s, j,sJ I y-/^ with probability 1/2 


where the e* are independent of each other and of everything else. 


E 


sup 1/(^2,., . . . , Yn,s) - E/(y2,s, . . . , 


= E 


< E 


= E 


sup |E' {f{Y 2 ,s, . . . , Yn,s) - fiYlg ,..., y;j) I 


sup \f{Y2,s,...,Y^,s)-f{Ylg,...,Y',s)\ 


sup |/(e 2 (y 2 ,s, YQ, eniY^,s, Y'^g)) - fie 2 {Yl,, Y 2 ,g ),..., y„,.)) | 


Introduce now the expectation operator Eg that computes expectation with respect to the random 
swaps only. Then we can further bound the expectation above by 


2EEg 


sup |/(e 2 (y 2 ,s,y 2 ,, 

seS**-! 


),..., en{Yn,gX,s)) - ^Jie2{Y2,g, Ylg ),..., e.(y„,„ y;j)| 


Next we bound the inner expectation. Note that for hxed Xn, X'n, by Lemma [l.l| there are at most 
different dichotomies of the 2 ( 2 ) points in Xn U X'n by hyperplanes including the origin and 
therefore there are not more than re^“ random variables of the form f{e 2 {Y 2 ,g, ^"2 «))•••) ^n{Yn,s, Yn g)) 
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as s varies over On the other hand, for any fixed s, the value of f{e 2 {Y 2 ,s, ^2 s)^ • • • j ^n{Yn,s,Yh «)) 

can change by at most 1 if one flips the value of one of the €i{Yi^s,Y- g) (i = 2 ,..., n), since such 
a flip amounts to changing the edges incident to vertex i and therefore can change the value of 
the chromatic number by at most one. Thus, by the bounded differences inequality (see, e.g., [H 
Section 6.1]), for all s G and A > 0, 


E, [exp {X{f{e 2 {Y 2 ,s, T'J,..., - E,/(e 2 (T 2 ,„ • • •, e„(Tn,., K,s))))] 

(n — 1)A^\ 


< exp 


Therefore, by a standard maximal inequality for sub-Gaussian random variables (O Section 2.5]), 

E, sup |/(e 2 (r 2 ,„y 2 ',J, ■ ■ .,eniYn,sX,s)) ” E,/(e 2 (^ 2 ,., ^ 2 , J, • • ■,eniYn,sX,s))\ 

< \/4:{n — l)dlogn . 

Since the upper bound is o(n/ logn) for d = o{n/ log^ n), the result follows. □ 

Parts (ii) and (iv) of Theorem |3.1| follow from the next, straingforward proposition by setting 
k= [(1 — e)n /(2 log 2 n)J and k! = [(1 + e)n /(2 log 2 n)]. 

Proposition 3.3. Let k,k' < n be positive integers. If d > then, with probability one, 

there exists s G S'^~^ such that x(r(X„,s)) < k. On the other hand, if d > ( 2 ), then, with 
probability one, there exists s G 5’'^“^ such that y;(r(X„, s)) > k'. 

Proof. Partition the vertex set [n] into k disjoint sets of size at most [n/fe] each. If for some 
s G each of these sets is an independent set (i.e., contain no edge joining two vertices within 
the set) in r(AC„,s), then the graph r(AC„,s) is clearly properly colorable with k colors. Let A 
be the set o f pairs of vertices (i,j) such that i and j belong to the same set of the partition. By 
Lemma 


1.1 


if d > > j^dj, the set of points {Xij : {i,j) G A} is shattered by half spaces. In 

particular, there exists an s G S'^~^ such that {Xi^j, s) < 0 for all {i,j) G A and therefore r(Xn, s) 
has no edge between any two vertices in the same set. The first statement follows. 

To prove the second statement, simply notice that is a gr aph has a clique of size k then its 
chromatic number at least k. But if d > (2), then, by Lemma 1.1 1 for some s G 5"^“^, the vertex 
set { 1 ,... ,k} forms a clique. □ 


It remains to prove Part (iii) of Theorem 3.1 To this end, we combine the covering argument 


used in parts (i) and (iii) of Theorem 2.1 with a result of Alon and Sudakov |2] (see Proposition 


5.8 below) that bounds the “resilience” of the chromatic number of a random graph. 

Let Cri be a minimal rf-cavev of 5"^“^ where we take r] = ce^/(\/dlog^ n) for a sufficiently small 
positive constant c. Then 


3sG5''-i :x(r(X„,s)) > (1 + e) 


n 


2 log 2 n 


< 1C,,1P jds G : ll5 - soil < V ■■ xXXn, s)) > (1 + 
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where so = (l)0)---)0). By the argument used in the proof of parts (i) and (iii) of Theorem 2.1 


U r(x„,s) cr(x„,so)u.E 

so||<5? 

where iii is a set of Bin((2),an) edges where, an = By our choice of rj, we have an < 

j (log2 n)^ where C2 is the constant appearing in Proposition 5.8 Thus, by the Chernoff 
bound, 

C2(log2 — l/2)e^n^' 


|.E| > 


C2^V? 

(logs 


< exp — 


(logs 


Hence, by Proposition 5.8 


LsG : ||s - soil < r? : x(r(X„, s)) > (1 + 

I 2 logs n J 

ClTl? 


< exp — 


C2(log2 — l/2)e^n^ 


(log2 n) 


+ exp - 


(logs n) 


Combining this bound with Lemma 5.1 implies the statement. 


□ 


4 Connectivity 

In this section we study connectivity of the random graphs in Gd,p{Xn)- It is well known since the 
pioneering work of Erdos and Renyi [7] that the threshold for connectivity for a G{n,p) random 
graph is when p = clogn/n. For c < 1, the graph is disconnected and for c > 1 it is connected, with 
high probability. In this section we investigate both regimes. In particular, for c > I we establish 
lower and upper bounds for the smallest dimension d such that some graph in Ga^dogn/ni^n) is 
disconnected. We prove that this value of d is of the order of (c— 1) log n/log log n. For the regime 
c < 1 we also establish lower and upper bounds for the smallest dimension d such that some graph 
in Gd^ciogn/ni^n) is Connected. As in the case of the clique number and chromatic number, here 
as well we observe a large degree of asymmetry. In order to witness some connected graphs in 
Gd,c\ogn/ni^n), the dimension d has to be at least of the order of n^~^. While we suspect that 
this bound is essentially tight, we do not have a matching upper bound. However, we are able to 
show that when d is of the order of nlogn, the family „,iogn/n(^n) not only contains connected 
graphs, but also, with high probability, for every spanning tree of the vertices [n], there exists an 
s G S'^~^ such that r(X„, s, t) contains the spanning tree. (Recall that t is such that p = I — ‘h(t).) 

Theorem 4.1. (connectivity.) Assume p = clogn/n and let t = 4>“^(1 — p). Then with high 
probability the following hold: 

(i) (subcritical; necessary.) If c < I then for any e G (0,c), if d = 0{n^~^~^), then for all 
s G T{Xn, s,t) is disconnected. 

( a ) (subcritical; sufficient.) There exists an absolute constant C such that if d > Cn^/logn, 
then there exists an s ^ S'^~^ such that T{Xn, s,t) is connected. 

(iii) (supercritical; necessary.) Ifc > 1 then for any e > 0, ifd < (1 —e)(c—I) logn/loglogn, 
then for all s G T{Xn, s,t) is connected. 


II 












(iv) (supercritical; sufficient.) Ifc > 1 then for any e > 0, ifd > (2+e)(c—1) logn/loglogn, 
then for some s G r{Xn, s,t) is disconnected. 

Proof of Theorem |4.1| , part (i). 

To prove part (i), we show that when d = with high probability, all graphs r(X„, s,t) 

contain at least one isolated point. The proof of this is based on a covering argument similar those 
used in parts of Theorems 2.1 and |3.1t combined with a sharp estimate for the probability that 


G{n,clogn/n) has no isolated vertex. This estimate, given in Lemma 5.9 below, is proved by an 
elegant argument of O’Connell m- 

Let r] G (0,1] to be specified below and let be a minimal rj-cover of S''’* If N{s) denotes 
the number of isolated vertices (i.e., vertices of degree 0) in r(X,i, s,t), then 


G S'’* ^ : r(X„,s,t) is connected| 
< P{3s G S'^-^ : A^(s) = o} 


|C,|P{ 


3s G S' 


d-l 


■ P — Soil < V ■ = o| 


where sq = (1,0,..., 0). It follows by the first half of Lemma 5.4 that there exists a constant k > 0 
such that if 7] = Ke/{ty/d), then 

p|3sGS'^-^ : ||s-so|| < r/: A^fc(s) = o| <P{iV = 0} 

whe re N is the number of isolated vertices in a G{n, (c + e/2) logn/n) random graph. By Lem ma 

and 


5.9 


for n sufficiently large, this is at most exp(—n Bounding |C,j| by Lemma 


substituting the chosen value of rj proves part (i). 


5.1 


Proof of Theorem 4.1, part (ii) 


Part (ii) of Theorem 4.1 follows from a significantly more general statement. Based on a geometrical 
argument, we show that for any positive integer k, if d is at least a sufficiently large constant 
multiple of fc<I>~^(l — p), then with high probability, k independent standard normal vectors in 
M'’* are shattered by half spaces of the form {x : {x,s) > t}. In particular, by taking k = n — 1 
and considering the normal vectors Xij corresponding to the edges of any fixed spanning tree, 
one finds an s G 5'^“^ such that T{Xn, s,t) contains all edges of the spanning tree, making the 
graph connected. Note that if d > Gny/a logn then the same statement holds whenever p = n~^ 
regardless of how large a is. Thus, for d ;:g> n-^logre, some r(X„,s,t) are connected, even though 
for a typical s, the graph is empty with high probability. 

Fix a set E of edges of the complete graph iL„. We say that Gd,p{Xn) shatters E if {e{G) : 
G G Gd^p(X„)} shatters E (where e{G) denotes the set of edges of a graph G). In other words, 
Gd,p{Xn) shatters E if for all F C E there is G G Gd,p{Xn) such that e{G) (1 E = F. 


Proposition 4.2. Fix n G N, A: G {1, 2,..., ( 2 )}, and a set E = {ei,...,efc} of edges of the 
complete graph K^. There exist universal constants b,c> 0 such that for d > (4/c) • k ■ 4>~^(1 — p) 
we have 


P (Gd,p(^n) shatters E) >1 — e *"’* . 
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Proof. Given points xi,..., in the affine span of xi,..., x^ is the set Ci = 

1}. Fix E = {ei,..., Cfc} G and let Pe be the affine span of Xe ^, • ■ ■, Xe,.. Also, let t = <1>“^(1—p). 

First suppose that min{||y|| : y G Pe} > t. Then we may shatter E as follows. First, almost 
surely, Pe is a (A: — l)-dimensional affine subspace in Assuming this occurs, then E is shattered 
by halfspaces in Pe'- in other words, for any E <Z E there is a (A — 2)-dimensional subspace H 
contained within Pe such that E and E \ F lie on opposite sides of H in Pe (i.e., in different 
connected components of Pe \ H). 

Fix F C E and H C Pe as in the preceding paragraph. Then let K he a {d — l)-dimensional 
hyperplane tangent to = {x G : ||x|| = t}, intersecting Pe only at H, and separating the 

origin from F. In other words, K is such that K H Pe = H and \K n = 1, and also such 

that 0 and F lie on opposite sides of K of \ K. Since Pe has dimension k — 1 < d — 2, such a 
hyperplane K exists. Since F and E\F lie on opposite sides of H, we also obtain that 0 and E\F 
lie on the same side of K. 

Let s G S'^~^ be such that ts G K. Then for e G F we have {Xg, s) > t, and for e G F \ F we 

have {Xe, s) < t. It follows that F n r(X, s, t) = F. Since F C F was arbitrary, this implies that 

F{Gd,p{Xn) shatters F) > P(min{||y|| : y G Pe] > ^>“^1 - P)) > 

In light of the assumption that d > (4/c) • k ■ — p), the proposition is then immediate from 

Lemma 14.31 below. □ 


The key element of the proof of Proposition 4.2 is that the affine span of k < Ad independent 
standard normal vectors in is at least at distance of the order oi d/k from the origin. This is 
made precise in the following lemma whose proof crucially uses a sharp estimate for the smallest 
singular value of a d x k Wishart matrix, due to Rudelson and Vershynin [Si- 


Lemma 4.3. There exist universal eonstants 6, c > 0 such that the following holds. Let Ni,..., 
be independent standard normal vectors in let P he the affine span of Ni,..., Nk, and let 
D = min{||y|| : y G P}. Then whenever d > Ak, we have F{D < cd/Ak) < 2e~^‘^. 

Proof. We use the notation y = (pi,..., p^). We have 


D = min 

y.T,yi=^ 




2 = 1 


= mm I y \ 
v.T,yi=^ 


E 

2 = 1 


Vi 

\\y\ 


N, 


1 

> min — 
y.J2yi=^ ^ 


1 

> — min 

k y.\y\2 = l 



2 


where the first inequality holds because if Vi = ^ then ||p|p > A: ^ and the second by noting 
that the vector (pj/||p||, 1 < i < k) has 2 -norm 1 . 
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Let N be the d x k matrix with columns N^,..., Nj^, and write N = (-/Vij)ijg[d]x[fc] 


. Then 


mm 

y :|y|2 = l 




2 = 1 


min 

y:|y|2 = l 




2=1 


min ||Xy|| 

y :|y|2 = l 


The final quantity is just the square of the least singular value of X. Theorem 1.1 of Rudelson and 
Vershynin m states the existence of absolute constants b,B > 0 such that for every e > 0 we have 

pf min ||Xy|| < e(Vd-+e"“. 

\y-\y\=^ J 

If d > 4(/c — 1) then \/d — > ^/d/2 and d — k + 1 > d. Combining the preceding probability 

bound with the lower bound on D,ii £ < e~^/B we then obtain 


D < e^— 1 < 2e“ 
4A:, 


Taking c = (e ^/B)'^ completes the proof. 


□ 


One may now easily use Proposition 4.2 to deduce part (ii) of Theorem 4.1 


Proposition 4.4. There are absolute constants b,C > 0 such that the following holds. For all 
p < 1/2, if d> Cn^ylog{l/p) then with probability at least 1 — there exists s G such that 
r(X, s, 4>“^(1 — p)) is connected. 


Proof. Fix any tree T with vertices [n], and write E for the edge set of T. By Proposition 4.2, if 
d > (4/c)-A;-<I>“^(l—p) then with probability at least 1 —there is s such that r(X, s, 4>“^(1—p)) 
contains T, so in particular is connected. Now simply observe that for p < 1/2 we have $“^(1—p) < 
V 21 og(l/p). □ 


Observe that the exponentially small failure probability stipulated in Proposition |4.4| allows us 
to conclude that if d is at least a sufficiently large constant multiple of n(logn V Y^log(l/p)), then, 
with high probability, for any spanning tree of the complete graph there exists s G 5"^“^ such 
that r(l^, s, 4’“^(1 — p)) contains that spanning tree. 


Proof of Theorem |4.l| part (iii). 

Let c > 1, e G (0,1), and assume that d < {1 — e)(c — 1) log n/log log n. Let E be the event that 
r(X„, t, s) is disconnected for some s G 5'^“^. Let be a minimal jy-cover of for p G (0,1] to 
be specified below. Then 

EC U , 

where Eg is the event that the graph ns' ||s-s'||<y is disconnected. Let c' = c— (c — l)e/2. 

Note that 1 < c' < c. It follows from the second half of Lemma |5.4| that there exists a constant 
K > 0 such that if p = k;(1 — d/c)/{tVd), then 

IPj-Fs} < IP {G(n, c'logn/n) is disconnected} < (1 + o(l)) , 


14 




















where the second inequality follows from standard estimates for the probability that a random 
graph is disconnected, see Palmer m Section 4.3]. Bounding jC^j by Lemma [Klj and using the 
fact that t = -v/ 2 TogTi(l + o(l)), we obtain that 

1P{^} < |C^|n^-"'(l + o(l)) 

as desired. 


Proof of Theorem |4.l| part (iv). 

Recall that p = clogre/n for c > 1 fixed, and that t = <l)“^(p). Let 0 < e < 1, and assume that 
d > (2 + e)(c — 1) log n/log log n. Define 9 G (0,7r/2) by 6 = (logn)“^/(^+^), so that log(l/0) = 
loglogn/(2 + e). Let "P be a maximal 0-packing of that is, V C is a set of maximal 

cardinality such that for all distinct s,s' € V we have (s, s') < cos 9. By Lemma 

|p| > ^0-id-i) 

' ' - 16 

It suffices to prove that for some s G P, r(X„, s,t) contains an isolated vertex. 

For each s G P, we write the number of isolated vertices in r(Xnj s, t) as 

n 

where Zij{s) equals 1 if {i, j} is not an edge in r(X„,s,t) and is 0 otherwise. We use the second 

moment method to prove that N Yls&v ^ ^ with high probability. This will establish the 
assertion of part (iv) since if A/' > 0 then there is s G such that r(Xn,, s, t) contains an isolated 
vertex. 

To show that > 0 with high probability, by the second moment method it suffices to prove 
that EA^ —)■ oo and that E[A^^] = (EA^)^(l + o(l). First, 

EA^ = jPj • n • E {vertex 1 is isolated in G{n,p)} = jPj • n(l — 

The lower bound on jPj and the inequality 1 — p < e~^ = together imply 

EA^ > — 

- 16 

which tends to infinity by our choice of 9. We now turn to the second moment. 


5.2 we have that 


= E E n z,,k{s)z^^,{s '). 

SjS'GV ijS[n] 

When s = s', separating the inner sum into diagonal and off-diagonal terms yields the identity 

n ^i,k{s)Z,,e{s) = nil-pr-^ + n{n - 1)(1 = n(l -p)-i • [1 + (n - 1)(1 -p)"-^] . 

i,j k^i,l^j 
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Let q = supg^gigg/^'pF{Zij{s)Zij{s') = 1} be the greatest probability that an edge is absent in 
both r(X„, s, t) and r(X,i, s', t). Then when s / s', the inner sum is bounded by 

+ n{n — 1) ■ q ■ {I — 


Combining these bounds, we obtain that 

E[N^] < \V\n{l-p)^-^ ■[l + {n-l){l-p)^-‘^] + \V\{\V\-l)-[nq^-^+n{n-l)-q-{l-pf'^-^]. 


The first term on the right is at most EiV(l + EA^/[(1 — p)|7^|]). The second is at most 


2^2/. _„N2(n-l) . i 


|iP|V(l-p) 




(1 -p)^ 


n—1 


+ 


{i-pY 


= {enY 


Q( 


(1-pY 


n—1 


+ 


(1-pY 


We will show below that q < (1 — p)^ ■ (1 + o(p)). Assuming this, the upper bounds on the two 
terms on the right together give 


E[iV2] 

[EiV] 


2 < 


EiV V ^ (1 -p)|iP|y ^ ^ n 


1 


as required. 

To prove the bound on q, fix s, s' G "P such that q = P{Zij(s)Zij(s') = 1}. Using the definition 
of Zij{s) and Zij{s'), we have 


q = F{{iJ} ^T{Xn,s,t),{i,j} ^r{Xn,s',t),} . 


We may apply Lemma 


5.5 


to this quantity, noting that in our case 9 = t = 0{V\nn) 

and ln(l/tp) = (1 + o(l)) Inn P> 0“^. This means that the Remark after the statement of the 
Lemma applies, and this gives precisely that q < {1 — pY (1 + o{p)), as desired. □ 


5 Appendix 

Here we gather some of the technical tools used in the paper. In the first section we summarize 
results involving covering and packing results of the unit sphere that are essential in dealing with 
the random graph process Gd^i/ 2 {Xn)- In Section 5.2 we describe analogous results needed for 


studying Gd,p{Xn) for small values of p. These lemmas play an important role in the proof of 
Theorem 4.1 Finally, in Section [5.3| we collect some results on G{n,p) random graphs needed in 


our proofs. 

5.1 Covering and packing 

Let B{a,b) = — tY~^dt be the beta function, and let Ix{a,b) be the incomplete beta 

function, 


Ix{a,b) = 




B{a,b) 


For a G [0, vr] and s G let 


Ca(s) = {s' G ^ : {s,s') > cos a} 
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be the cap in ^ consisting of points at angle at most a from s. For a < 7112 the area of this cap 
(see, e.g., [lO!) is 

|C„(<i)| = . (1) 

We use the following standard estimate of the covering numbers of the Euclidean sphere (see, 
e.g., m Lemma 13.1.1]). 

Lemma 5.1. For any rj G (0,1] there exists a subset Crj of S'^~^ of size at most ( 4 / 77 )'^ such that 
for all s G 5^“^ there exists s' G Cjj with jjs — s'jj < 77 . 

We now provide a rough lower bound on the number of points that can be packed in S'^~^ while 
keeping all pairwise angles large. 

Lemma 5.2. For any 9 G (0,7r/2) there exists a subset V 0 of S'^~^ of size at least 


d 

16 


9- 


(d-i) 


such that for all distinct s, s' G Ve we have (s, s') < cos0. 


Proof. First note that it suffices to consider 9 < 1/2 because otherwise the first bound dominates. 
Consider N independent standard normal vectors Xi, ..., Xn- Then Ui = Xi/\\Xi\\ {i = 1,, N) 
are independent, uniformly distributed on Let 


N 

^ ~ |(t/i,U'j>|<cos(0)}. 

i=\ 


Denoting P{| (?7j, Uf) \ > cos(0)} = (j), 

EZ = A^(l - > iV(l - (pN) > N/2 


whenever cpN <1/2. Since Z < N, this implies that 


N 

4 


EZ - N A 1 
> - 1 — > - 

- N - N A - 3 


and therefore there exists a packing set A of cardinality |j 4| > A^/4 as long as (pN < 1/2. To study 
(p, note that 







where Y = (Yi,..., Y^), Y' = (Y /,..., Y/) are independent standard normal vectors. By rotational 
invariance, we may replace Y' by (||y'||,0,...,0), and therefore 

(p = I 

= F{B <cos'^{9)} 

(where i? is a Beta(l/2, (d — l)/2) random variable) 

29"^-^ 

> - . 

- d-1 


The result follows. 


□ 
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Figure 1: Since cos(a) = r//2 = — x'^/rj, the height of the spherical cap that only includes 

points at distance at least rj from the equator is 1 — x = 1 — — ri‘^/2. 


The next lemma is used repeatedly in the proof of Theorem 2.1 and|3.1 


Lemma 5.3. Fix s' G 5"^“^ and rj G (0,1] and assume that d > 12. The probability that there exists 
s G with IIs — s'II < f] such that vertex 1 and vertex 2 are connected in T{Xn,s) but not in 

r(X,i,s') is at most 

rd 

"V2vr • 

Proof. Without loss of generality, assume that s' = (1,0,..., 0). Observe that the event that there 
exists s' G with ||s — s'|| < rj such that vertex 1 and vertex 2 are connected in r(W„,s) but 
not in r(X„, s') is equivalent to Xi^ 2 /||^i, 2 || having its first component between — ri‘^/2 and 

0 (see Figure [^. Letting Z = (Zi,..., Zd) be a standard normal vector in W^, the probability of 
this is 


Ill’ll 


G - 


W2,0) 


< 


A. 

Il^ll 


G (-r?,0) 


-F{B < r/2} 

(where is a Beta(l/2, {d — l)/2) random variable) 


< 


-V(l/2,(d-l)/2) 

1 r 

2Bil/2,id-l)/2)Jo 

V 


X 




.X 


2B{l/2,{d-l)/2) 
Id-l 

< rj\l —— • n 


27r 
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5.2 Auxiliary results for Grfp(A„) 

In this section we develop some of the main tools for dealing with the random graph process 
Gd^p{Xn)- We assume throughout the section that 


p := 1 — <h(t) < 


1 


( 2 ) 


Recall from the start of Section 5.1 that Ca{s) denotes the spherical cap consisting of all unit 
vectors with an angle of < a with s. We will use the following expressions for Cq(s): 


Ca { s ) = {s'G ^ : ||s — s'lp < 2 (1 — cos a)} 

= {scosO + wsinO : w € S'^~^ n {v}^ , 0 < 9 < a}. (3) 


We are interested in studying the graphs T{Xn, s',t), for all s' G simultaneously. 

Lemma 5.4. There exists a constant c > 0 such that, for all £ G (0,1/2), if t > 0 and p are as in 

m, 

£ 


TT 


0 < a < —, tana < — ,_, 

“ “ 2 ’ 

then, for some universal c> 0, if we define £':=£ + c (e^ + e/V 1)), 

1. the union r+ := Us'eCQ(s) stochastically dominated by G{n, (1 + fi) p); 

2. the intersection T- := r(A„, s', t) stochastically dominates by G{n,(l — £')p). 

Proof. The first step in this argument is to note that the edges of both r_|_ and r_ are independent. 
To see this, just notice that, for any {i,j} G (^ 2 ^), the event that {i,j} is an edge in r± depends 
on Xn only through Xij. More specifically, 

{i,j}Gr+ 44^ 3s'G Ca(s) : (Ajj,s') > t; 

{bj} ^ T- ^ Ca{s) : {Xij,s) > t. 


The main consequence of independence is that we will be done once we show that 

(1 -£')p< P{{i, j} G r_} < P{{i, j} G r+} < (1 + e') p. (4) 

As a second step in our proof, we analyze the inner product of Xij with s' = s cos 9 + w sin 9 G 
Ca{s) (with the same notation as in (|^). Note that 

(s', Xij) = N cos 9 + (^w, X^j^ sin 9 = cos 9 + (w, X:^j^ tan 9^ , 

where N := {Xij,s) and X^-j is the component of Xij that is orthogonal to s. Crucially, the 
fact that Xij is a standard Gaussian random vector implies that A is a standard normal random 
variable and X^j is an independent standard normal random vector in s"*". Moreover, 

Vw G (^w,X^j^ I < X := \\Xi;j\\. 

Since “0 1 —)• tan0” is increasing in [0, a], we conclude 

Vs'G Cq(s) : (s', Ajj) = cos0 (a + A(s')) , where |A(s')| < (tana)x. (5) 


19 





Our third step is to relate the above to the events G r±}. On the one hand, 


{i,j} G r+ max > t 

s'^Ca (s) 

N + max A(s') > t (use ® and 0 < cos0 < 1) 

s'eCcis) ^ 

A > t — (tana) X, 

and we conclude (using the independence of N and x) that 

P{{i, j} G r+} < 1 - E[$(t - (tana)x)]- 


Similarly, 


{i,j} G r_ min (s'jXij) > t 

s'GCct{s) 


t 


N + min A(sO > 

s'eCais) cos a 

t 

<^= N > -h (tan a) x, 

cos a 


(by @ and cos 6 > cos a > 0) 


and we conclude 


i,j} G r_} > E 


1 - $ 


cos a 


+ (tana) x 


The remainder of the proof splits into two cases, depending on whether or not 


(6) 


( 7 ) 


e^(l_$(t))>l (8) 

Note that this condition holds if and only if t > C for some C > 0, as 1 — <h(t) = 

5^2 

when t —)• +00 and e”^ (1 — ^>(t)) = 1/2 < 1 when t = 0. 


Last step when is violated. In this case t is bounded above, so p > cq for some positive 
constant cq > 0. We combine ([^ and Q with the fact that ^{t) is (27r)“^/^-Lipschitz. The upshot 
is that 

|i-^(t)-P{{b/}er±}| < ^ 

V TT 

Now X is the norm of a d — 1 dimensional standard normal random vector, so E[x] < -y/E[x^] = 
\/d — 1. The choice of a implies: 


1 - 


cos a 


t + E[x] tana. 


1 

1 - 

cos a 


O(sina) = O 


d-1 


and tana < 


Vd^' 


So 


l-cb(t)-P{{i,j} Gr±}| < 



(ce^ + e) < 


e + c 



for some universal c > 0. 


P 
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Last step when ^ is satisfied. We start with Q and note that we can apply Lemma 5.6 with 
r := t and 


h:= ( — 

\cos a 

After simple calculations, this gives 

p{{i,j}Gr_} 


— 1 ] t + (tan a) X < 0((tan a)^) t + (tan a) y. 


> E [exp (-X)], 


1 - m 


where 


X := 0{{tana)^) {t^ + 1) — {t + t ) (tana)x — (tana)^ — 0((tana)^)t^. 

By Jensen’s inequality, E[e“^] > Since E[x]^ < E[x^] = d — 1 and tana = e/t\/d — 1 in 


this case, 


E[A] < O 


d-l 


+ (1 + 0(e + t ^)) £. 


In other words, if we choose c > 0 in the statement of the theorem to be large enough, we can 
ensure that 

p{{i,j} Gr_} 


1 - $(t) 




We now turn to Q. Applying Lemma 5.6 below with r := t — xtana when r > t/2, we get 

1 -$(t- (tana)x) < e(*+f) f""" (1 -^(i))- (9) 


In fact, the same inequality holds when r < t/2, i.e., (tana) x > i/2, for in that case the right-hand 

5t2 n n 

side is > e^ (1 — <h(i)) > 1 (recall that we are under the assumption (8)). So (9) always holds, 
and integration over x gives 


^ < ]Erg(*+f) 2 ’ 1 

l-$(t) “ ^ 


( 10 ) 


It remains to estimate the moment generating function on the right-hand side. The first step is to 
note that, since E[^] is the norm of a d — 1 dimensional standard normal vector, E[x] < E[x^]^^^ = 
\/d — 1. So by Cauchy Schwartz, 

e~(*+f) E[e^*'*'^) 2 ^ ^ ] < E[e(*'*~^) 2 ^ ^ ] 

< Y^E[e1(X-EM)]]E[e(tan a)2 ]. ( 11 ) 

Next we estimate each of the two expectations on the right-hand side of the last line. In the first 
case we have the moment generating function of x ~ E[x], where x is a 1-Lipschitz function of a 
standard Gaussian vector. A standard Gaussian concentration argument and our definition of a 
give 

/ /I \ 12^+41 (tano:)^ 

E[e(2*+i) (x-EM)] < -- < i + ^^^2 
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for some universal constant cq > 0. The second tern in 0 is the moment generating function of 
X^, a chi-sqnared random variable with d — 1 degrees of freedom. Since (tana)^ < ^jid— 1) < 1/2 
under onr assumptions, one can compute explicitly 


E[e 


(tan afx^] — 


dl2 


^ 1 T Cq £ 


1 — 2 (tan a )2 

for a (potentially larger, but still universal cq > 0). Plugging the two estimates back into 0 , we 
obtain 

]E[e(t+f)(tana)x+ ^*^" 2^"^" ] < e(*+f ) (1 + Cq £^), 

and the fact that t {tan a)\/d — 1 = e implies that the right-hand side is < 1 -|- e -)- c{t~‘^e + £^) 
for some universal c > 0. Going back to (10) we see that this finishes our upper bound for 

P{{i,j}Gr+}. □ 


Correlations between edges and non-edges 

In this case we consider s, s' G 5"^“^ and look at correlations of “edge events.” 
Lemma 5.5. For any t > 1, 0 < 0 < n, define 

(1 — cos 9)“^ 


^ ;= 1 — COS0, 7 := 


sm ( 


Then there exists a universal constant C > 0 such that for s, s' £ S'^ ^ such that {s, s') < cos 6, we 
have 

F{{Xij,s) > t, {Xij,s') > t} <p[{Cptf^^^^ + 


( 12 ) 


F{{Xij,s) <t, {Xij,s') <t} <l-2p + p[{Cpt)^^~^^^ Fe''^^ 2 p] 


Remark, (nearly equal vectors.) Suppose p = o(l) and 9 = o(l). One may check that 
7 = (1 -|-o(l)) 9^/4 and f = (1 -|-o(l)) 0^/2. This means that if 9^ t^ = o(ln(l/p)) and 9“^ \n{l/tp) = 
a;(l), then 

F{{Xij, s) < t, {Xij,s') <t} <l-2p + o{p) = (1 - pf (1 o{p)). 

This is nsed in the proof of Theorem |4.1[ part (iv) above. 


Proof. We focus on the inequalities in (12), from which the other inqualities follow. For convenience, 
we write rj := cos 9 and note that 


r/ = l — so 7 = l — 


1 - (l+Qrj 


Tj^ 


(13) 


Moreover, 0 < 7 < 1: the first inequality is obvious, and the second follows from the fact that 
^ TT (1 — COS0)^ (1 — COS0) (1 -|- COS0) 1 — COS^0 

0 < 0 < - ^ 0 < 7 = ^^ = sm0 < 1. 

2 sm 9 sm 9 sm 9 


Let E denote the event in (12). The properties of standard Gaussian vectors imply 


p{f;} = p({iVi > t} n {7? iVi + 7/1-7/^ N2 > t}) 
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where A^i,A ^2 are independent standard normal random variables. In particular, we can upper 
bound 


IP{^} < IP{iVi > (1 + 0 >t}r<N2> 




t 


rj^ 


(14) 


The first term in the right-hand side is l — <h(t + ^t) < e ^2 (1 — <I)(t)) = e ^ 2 ^ *^(1 —<h(t)) 


by Lemma 5.6 The fact that 


,i.n. = 1 , 


t->-+oo g 

implies that, for t > 1, the ratio is bounded by a C t, C > 0 a constant. We conclude 

> (1 + 0^} < P < p {C t 


As for the second term in the right-hand side of (14), we apply Lemma 5.6 with 


r := 


Vi - 


= (1 — 7 ) t and h ■.= ^t. 


We deduce: 


N 2 > + 4 = 1- (1 - $(t)). 




The proof finishes by combining the estimates for the right-hand side of (14). 
Lemma 5.6. //e G (0,1/2), r > 0 and h > 0, 

< l-^{r + h) 


1 - <l)(r) 

Proof. We first show the upper bound, namely: 


< e 


-hr-^ 


Vr, h > 0 : 1 — <h(r + h) < e ^(1 — ^(j’))- 


To see this, we note that: 


1 — $(r -|- h) = 


< 


L 

L 

i: 


_ (x + r+h) 

^ e 2 




dx 


. (x+r)'^ 

.+00 g-^- 2 ^ 






. (cc + r)^ 

+00 p- ^ 2 h ‘2 

" e-^^-^dx 


/o 

[1 — 4>(r)] e 


-rh-V 


(15) 


□ 


(16) 
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To continue, we go back to the formula 


1 — $(r + h) = 


+00 g-(a;+r)h 




dx \ e 2 


which is clearly related to 


In fact, inspection reveals that 


/■ + 00 g-^ 

<l>(r) = / - ,_ dx. 

Jo 


(^+r) 


1 - 


1 - <h(r) 

Using Jensen’s inequality, we have 


1 — <I>(r + h) -h V I 1 

^ '= e 2 E[e \ N > r]. 


1 - <h(r + h) h? hE[N\N>r] 

1 - $(r) - ^ ^ 


and (16) means that F{N — r>t\N>r}<e so E[A^ | -/V > r] < r + ^. We deduce: 

1 — <I>(r + h) 




1 — <k(r) 


> e 2 e 


as desired. 


□ 


5.3 Random graph lemmas 

Here we collect some results on random graphs that we need in the arguments. In the proof of 


Theorem 2.1 we use the following lower tail estimate of the clique number of an Erdos-Renyi random 
graph that follows from a standard use of Janson’s inequality. 

Lemma 5.7. Let Nk denote the number of cliques of size k of a G{n, 1/2 —an) Erdos-Renyi random 
graph where 0 < «„ < 1/n and let 6 > 2. Denote to = 21og2n — 21og2log2n + 21og2e — 1. If 
k = [cj — JJ, then there exists a constant C such that for all n, 


’{iVfc = 0} < exp 


/ -C'n^ 

V (log2 n)^ 


Proof. Write p = 1/2 — and define ojp = 2 logj^/p n — 2 log;^/^ logi/p n -|- 2 log;^/p(e/2) + 1. We use 
Janson’s inequality ([9l Theorem 2.18]) which implies that 


’ {Nk = 0} < exp 


-mkf 

A 


where EA^ = and 
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To bound the ratio A/(EA^fc)^, we may repeat the calculations of Matula’s theorem on the 2-point 
concentration of the clique number ([l2])j as in Palmer m Section 5.3]. 

Let j3 = log;^/p(31og;^/pn)/log]^/pn and define m = [/3A:J Then we split the sum 


A 


(EAfc)2 

To bound the first term, we write 


k (k\ (n-k\ m—1 (k\ (n-k\ 

E + E 

j=m j=2 


^ (j) C-j) -m _ ^("t) 
2- © 


j=m 


EAfc 


where F{m) = Yl^=m (j) (fc-j)P ( 2 )+( 2 ). Now if /c = [wp — (5J for some 5 G (0,Wp), then the 
computations in Palmer m pp.77-78] show that 




which is bounded whenever 


kn^{l/p) 


= 0 ( 1 ). 


p-k{l+0)/2 

This is guaranteed by our choice of /? = log]^/p(3 log^/p n )/ log^/p n. Hence, the first term is bounded 
by 

^=0(l)VSp“/^ 

For the second term, once again just like in [H], note that 

m—1 (k\ fn—k\ 

j=2 ykJ 




i=2 

m—1 


< 


< 


0 ( 1 ) E 


kp 


—m/2 


n 


1=2 

”^-Y2(logi/,n)^y 


< O 


0(1) E 

j=2 \ 
f (logi/pu) 
n2 


n 


Putting everything together, we have that there exist constants C, C such that for k = [wp — 5], 


’{Afc = 0} < exp -C 




< exp 


-(E'n2 


((log2n)V 


whenever h > 2. Noting that Wp = a; + 0{an logn) completes the proof. 


□ 
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Part (iii) of Theorem |3.1| crucially hinges on the following interesting result of Alon and Sudakov 
[2] on the “resilience” of the chromatic number of a G(n, 1/2) random graph. The form of the 
theorem cited here does not explicitly appear in [2] but the estimates for the probability of failure 
follow by a simple inspection of the proof of their Theorem 1.2. 


Proposition 5.8. ([21 Theorem 1.2]). There exist positive constants ci,C 2 such that the fol¬ 
lowing holds. Let e > 0 and let G be a G(n, 1/2) random graph. With probability at least 
1 — exp(cin^/(logn)^), for every collection E of at most C 2 e^n^/(log 2 n)^ edges, the chromatic 
number of G U E is at most (1 + e)n/(2 log 2 n). 


The final lemma is used in proving part (i) of Theorem 4.1 


Lemma 5.9. Fixc^ (0)1)- With p = c\ogn/n, let N be the number of isolated vertices in G{n,p). 
Then for n large, F{N = 0) < exp(—n^“'^/3). 


Proof. The following approach is borrowed from O’Connell m- Fix q = 1 — y/1 — p and let D{n, q) 
be the random directed graph with vertices [n] in which each oriented edge ij appears independently 
with probability q. Write I for the number of vertices of D{n, q) with no incoming edges, and M for 
the number of isolated vertices in D{n, q), with no incoming or outgoing edges. Then M and N have 
the same distribution. Next, observe that I has law Bin (n, (1 — q)^~^) = Bin (n, (1 — . 

Furthermore, conditional on /, 


M = Bin (^I, . 

It follows that 

F{N = 0) = P(M = 0) 

< P(|I-E/| > E//2) + sup E(Bin(fc,(l-p)(”"'')/2) =0). (17) 

A:e(l/2)E/,(3/2)E/ ' 

For the first term, a Chernoff bound gives 

E(|/ - E/| > E//2) < ^ 

where the last inequality holds since (1 — = (1 -|- o(l)n“‘’/^. Next, fix k as in the above 

supremum. For such k we have p{n — k) = clogn + 0{logn/n^^^). Using this fact and that 
2 

1 — p > e~^~^ for p small yields 

P(Bin (fe, (1 - = 0) = (1 - (1 - p)(^-U/2)fc 

<exp(-A:(l-p)(’^-^)/2^ 

= exp 

= exp (^—(1 + o(l))A:n“‘’/^^ . 

2 

Using that 1 — p > e~^~P a second time gives 

k > E//2 = n(l -p)("-i)/V2 > (1 + o(l))ne-”^’/V2 = (1 + o(l))ni-"/V2. 


The two preceding inequalities together imply that 

E(Bin ^/c, (1 — = 0) < exp (—(1/2 + o(l)) • . 

Using this bound and ([T^ in the inequality ([Xt]), the result follows easily. □ 
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